Rank in Wordlist | Frequency | Word |
---|---|---|
1931 | 139 | 10,00 |
1975 | 136 | 000,00 |
4310 | 51 | 17,00 |
6503 | 29 | 1,5 |
6846 | 27 | 15,00 |
7020 | 26 | 1,3 |
7233 | 25 | 2,5 |
7237 | 25 | 3,5 |
7440 | 24 | 1,6 |
7442 | 24 | 17,30 |
Rank in Wordlist | Frequency | Word |
---|---|---|
9817 | 16 | IPC(A |
9818 | 16 | IPC(B |
16104 | 7 | IPC(Geral |
17044 | 7 | projectos)(1 |
19754 | 5 | Registo(s |
23294 | 4 | dB(A |
29205 | 3 | o(s |
30299 | 3 | 《ETAPM》(Consulte |
32299 | 2 | Copyright(C |
32481 | 2 | Direito(Ciências |
Rank in Wordlist | Frequency | Word |
---|---|---|
17044 | 7 | projectos)(1 |
17664 | 6 | MIECF)» |
17667 | 6 | Macau)Apoio |
24228 | 4 | patronal)Despesas |
26015 | 3 | Fase)» |
26493 | 3 | P.A.). |
27699 | 3 | concurso)1 |
29508 | 3 | projecto). |
30689 | 2 | 1999-2000)Lei |
31524 | 2 | 9002501375). |
Rank in Wordlist | Frequency | Word |
---|---|---|
2935 | 84 | 10% |
3512 | 67 | 5% |
4203 | 53 | 20% |
4255 | 52 | 50% |
5936 | 33 | 4% |
6075 | 32 | 30% |
6202 | 31 | 15% |
6203 | 31 | 3% |
6670 | 28 | 40% |
6847 | 27 | 7% |
Rank in Wordlist | Frequency | Word |
---|---|---|
21916 | 4 | C&C |
37231 | 2 | está |
38306 | 2 | já |
38800 | 2 | não |
39354 | 2 | possível |
49030 | 1 | Automóveis |
49083 | 1 | B&C |
50554 | 1 | Comissão |
50943 | 1 | Cow&Gate |
51797 | 1 | Dão&Douro |
Rank in Wordlist | Frequency | Word |
---|---|---|
7269 | 25 | HK$ |
12264 | 11 | $1 |
16088 | 7 | HKD$1.000,00 |
16089 | 7 | HKD$10.000,00 |
17698 | 6 | Mop$ |
22323 | 4 | MOP$ |
22324 | 4 | MOP$500 |
24921 | 3 | $10 |
24922 | 3 | $100,00 |
26318 | 3 | MOP$500,00 |
Rank in Wordlist | Frequency | Word |
---|---|---|
7756 | 23 | d'Assumpção |
11714 | 12 | D'Assumpção |
21800 | 4 | A'S |
22617 | 4 | T'oi |
35404 | 2 | atribu'do |
45570 | 1 | 4'50 |
48001 | 1 | A's |
49543 | 1 | Brick's |
49730 | 1 | CD's |
51097 | 1 | D'OURO |
Rank in Wordlist | Frequency | Word |
---|---|---|
4375 | 50 | 2013/2014 |
4501 | 49 | r/c |
4682 | 46 | http://www |
5618 | 36 | e/ou |
6506 | 29 | 2011/2012 |
7417 | 25 | reuniões/conferências |
8162 | 21 | 2014/2015 |
8767 | 19 | 2008/2009 |
9451 | 17 | Macau/GGRAsia |
9987 | 16 | exposições/exibições |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots